A Bounded Rational Driver Model 
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This paper introduces a car following model where the driving scheme takes into account the defi- 
ciencies of human decision making in a general way. Additionally, it improves certain shortcomings 
of most of the models currently in use: it is stochastic but has a continuous acceleration. This is 
achieved at the cost of formulating the model in terms of the time derivative of the acceleration, 
making it non-Newtonian. 



To understand traffic flow, it is mandatory to analyze 
the interaction between the cars. The simplest case is 
that of a car following a lead car. To describe this pro- 
cess, a big number of models have been invented (for a 
review see [0, |||). These models differ in the details of 
the interaction between the cars, and the time update 
rule, ranging from differential equations to cellular au- 
tomata. Mostly, they describe this process by an equa- 
tion a = a(v, h, V) that relates the change in the current 
velocity v (the acceleration a) to the velocity v of the fol- 
lowing car, the distance h ("headway") to the car ahead, 
and its speed V, respectively. 

Considerable effort has been invested to investigate 
the emerging macroscopic behavior from the underlying 
microscopic dynamics of interacting cars. Nevertheless, 
there is still a lot of controversy in both the macroscopic 
behavior when compared to reality ||, and in the mi- 
croscopic foundations of the individual car dynamics. In 
particular, the observed non-damped oscillations in the 
relative motion of vehicles, which are illustrated in Fig. |l| 
are often explained by the instability in the cooperative 
motion of the car ensemble only (see, e.g., [Q, ||]). In 
fact, subjected to reasonable physical constraints the re- 
lation a = a(v, h, V) seems to be hardly able to predict 
an instability in the following car motion provided the 
car ahead moves at a constant velocity. However, recent 
models j|, [|, ^ display a certain kind of instability in the 
car following process itself. 

There are actually two stimuli affecting the driver be- 
havior. One of them is the necessity to move at the mean 
speed of traffic flow, i.e., with the speed V of the lead- 
ing car. So, first, the driver should control the velocity 
difference v — V. The other is the necessity to maintain 
a safe headway h opt (V) depending on the velocity V. In 
particular, the earliest "follow-the-leader" models |t], ||] 
take into account the former stimulus only without re- 
garding the headway h at all. By contrast, the "optimal 
velocity" model || [To| directly relates the acceleration 
a to the difference between the current velocity v and 
a certain optimal value i9 pt(/j) at the current headway, 
a oc [v — # pt (/*)]■ Of course, more sophisticated approx- 
imations, e.g., Jul fi^, [H| |l4], |ll| [l6| to name but a few, 
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FIG. 1: Measured car-following behavior. Data are recorded 
by an equipped car measuring distance h and speed v and 
computing v — V during a drive on a German freeway. 



allow for both stimuli. 

It is not very likely that the variables {v, h, V} do spec- 
ify the acceleration a completely. Since drivers have moti- 
vations and follow only partly physical regularities, mem- 
ory effects may be essential. In a simple manner, this has 
been introduced in models that relate the current accel- 
eration a(t) to the velocity v(t — r a ) and the headway 
h(t — T a ) at a previous moment t — r a (for a review of the 
"following-the- leader" models sec, e.g., Ref. 
the "optimal velocity model" see Ref. Q). 
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the delay time in the driver response which is treated as a 
constant. This approach is not completely satisfactorily, 
since first, the physiological delay in the driver response 
seems to be too short to be of importance. Second, it 
is not clear why the memory effects relate only two mo- 
ments of time instead of a certain interval as a whole. 
Third, the dependence of the time scale T a on the car 
motion state is missing. Nevertheless, these models show 
an instability in the car-following dynamics (provided r a 
is big enough) and are non-Newtonian as well. 

In the following, reasons of another nature than the 
driver response delay lead beyond the framework of New- 
ton's mechanics. A corresponding model for the following 
car dynamics displaying an instability around the sta- 
tionary motion is proposed. To describe the driver be- 
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havior, the approach suggested in Ref. M] will be used. 
There, drivers plan their behavior for a certain time in 
advance instead of simply reacting to the surrounding sit- 
uation. A similar idea related to the optimum design of a 
distance controlling driver assistance system is discussed 
in Ref. (2lJ . In mathematical terms the driver's planning 
of her further motion is reduced to finding extremals of 
a certain priority functional that ranks outcomes of dif- 
ferent driving strategies. Here, the assumption that the 
driver is rational plays the crucial role. It means that the 
driver continuously correct the car motion to follow the 
optimal strategy. In this case p2| , the collection of vari- 
ables {v, h, V} does specify the car acceleration a com- 
pletely. However, the assumed continuous control is im- 
possible to achieve for humans. Therefore, it is assumed 
below that a real driver, first, cannot compute the opti- 
mal path of motion exactly and, second, that she cannot 
correct the car motion continuously. 

This is just the approach that is known as bounded 
rationality [^3f . Even if a driver succeeds in finding the 
optimal solution, she is only capable of setting the accel- 
eration to a fixed value. After that, she waits until the 
deviation from her priority functional has become too big 
to ignore, leading to a re-computation of another more 
or less optimal path. Or, to put it differently, drivers are 
simply not capable of resolving small differences between 
a given value of acceleration, speed, or headway and their 
"optimal" desired values. 

The re-computations are assumed to happen stochasti- 
cally, with a probability that increases with the deviation 
from the desired state. So, the model described below 
becomes a stochastic one. The action of noise can be 
modeled either explicitly by introducing certain thresh- 
olds (as is done in the psycho-physical models |l^] ) or by 
making the noise amplitude dependent on the distance 
between the current and optimal state. This defines a 
dynamic trap model 24 , an approach that will be fol- 
lowed below. 

To make the model more realistic, it is demanded that 
the trajectories of acceleration, speed, and headway are 
continuous functions of time. This can be achieved by 
formulating the model in terms of the time-derivative of 
acceleration called jerk and adding a white-noise term 
there. In what follows, that the acceleration is a col- 
orized noise process without jumps, and so are the other 
integrals of motion (speed and headway, respectively). 



MODEL DESCRIPTION 
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FIG. 2: The driver strategy of governing the car motion. 

the constant acceleration as the optimal one. 

A certain time interval r a later, the driver has to cor- 
rect the car motion again. This can be done by shifting 
the current acceleration a(t + r a ) towards the desired op- 
timal value a opt (t + T a ) = -<9 t 2 f) opt (t, t + T a )\ l=t +r a known 
to her approximately: 

a(t + T a ) - a(t) = C {a opt (t + r a ) - a{t)) + a lnd (t + r a ) , 

where C < 1 is a constant about unity and the ran- 
dom term a Tnd (t + r a ) allows for the uncertainty in the 
driver evaluation of the optimal acceleration at the cur- 
rent time. Its mean amplitude a c characterizes physio- 
logical properties of drivers and can be considered con- 
stant. Thereby, (a Tnd (t) ■ a Tnd (t')) = a 2 S t . t ' , where 8 t . t > is 
Kronecker's delta. 

This discrete representation of the car motion correc- 
tion is converted to a continuous description based on 
stochastic differential equations. Namely, the above dis- 
crete governing equation is reduced to 

^ = --(a-a op t(h,v,V))+rt(t). (1) 
at r a 

Here, a op t(/i, v 7 V) is the optimal acceleration specified by 
the current values of headway, car velocity, and leading 
car velocity. The term £(i) is white noise of unit ampli- 
tude which models the uncertainty in the driver evalua- 
tion of the optimal motion. 

The acceleration increment Sa caused by the random 
force r?£(i) acting during the time r a is actually the ran- 
dom component a rnd (t) entering the discrete governing 
equation. Thus, it follows from the estimate ((Sa) 2 ) ~ 
rj 2 T a that 
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(2) 



Assume that at a certain instant of time t the driver 
has decided to correct the car motion and chosen the 
acceleration a(t) (Fig. ||). As discussed above, the opti- 
mal path {f) pt(t, t)} of the further motion (t > t) is too 
complex for her to compute and to follow it. So, she re- 
gards the path {f) a (t, t) : a(i,t) — a(t)} characterized by 



The time scale r a of the driver control over the car mo- 
tion depends on the state (h, v, V, a). Thus, the stochas- 
tic differential equation ([!]) contains multiplicative noise. 
So its type with respect to the corresponding Fokker- 
Planck equation has to be specified. The adopted driv- 
ing strategy (Fig. |^) implies that all the characteristics 
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of correcting the car motion are determined by its state 
at the "terminal" point t + r a rather than at the "initial" 
point t. Therefore, it is reasonable for Eq. (|l|) to be of 
Klimontovich type or, according to the classification in 
p5[ , to describe a "postpoint" random process. 

To complete the model, a op t(h, v, V) and T a (h, v, V, a) 
have to be specified. The simple ansatz 



a op t(h,v, V) 
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(v-V)--g h (h~h v ) 



(3) 



is well justified, at least, near the stationary state of the 
car motion, v — V and h — hv ■ It should be noted 
that similar ideas about a opt (h,v,V) and a dependence 
of r a on the motion state had been discussed already in 
Ref. 0. (See also |1| for a discussion.) 

Here, r is the characteristic time of the velocity varia- 
tions and the constant < 1. The limit g^ <C 1 deserves 
special attention because it is just the condition that a 
driver, at first, prefers to eliminate the velocity differ- 
ence v — V between her car and the car ahead and only 
then optimizes the headway. In this case the optimal dy- 
namics of car motion, i.e., the car dynamics governed by 
the relation a — a opt (h,v,V) is a pure fading relaxation 
towards the stationary state. Conversely, the model un- 
der consideration predicts complex oscillations in the car 
motion. Note, that the adopted assumption about the 
value of the coefficient g^ can be justified by applying to 
the general principles of the car motion . 

If the car motion state is far from equilibrium the ne- 
cessity for correcting the velocity and headway distance 
is obvious. In this case it is natural to suppose that 
the characteristic time interval r a between sequential at- 
tempts to correct the car motion should be compara- 
ble to r which characterizes the velocity variations, i.e., 
r a ~ r/g v . Here, g v > 1 is an additional model parame- 
ter. When the car motion comes close to the equilibrium 
and the inequality \a op t(h, v,V)\ < a c is fulfilled the un- 
certainty a ln d(t + To) in evaluating the optimal acceler- 
ation becomes significant. Under such conditions there 
is no reason for the driver to affect the car motion and 
she may not correct it at all. It means that the car mo- 
tion control is depressed and, correspondingly, the cor- 
rection time interval r a grows dramatically inside a do- 
main Q u of the phase plane {h, v, V} where the inequality 
\a opt (h,V, V)\ < a c holds. 

To compute the function T a (h, v, V, a), the boundary 
of the domain Q„ has to be analyzed. Note, that the 
acceleration itself enters the driver's perception of mo- 
tion quality: without any reason, a driver prefers not 
to accelerate at all. When the car motion control is ac- 
tive the estimate a ~ a/r a by virtue of Eq. (|l]) can be 
adopted. So, the boundary of the domain Q u is specified 
by a,o pt (h, v, V) + /i 2 a 2 ~ a 2 ., where \i ~ 1 is a certain co- 
efficient about unity. Assuming the variables h, v, a to be 
independent of one another inside Q u and averaging the 
latter expression over Q u its boundary <&(/i, v, V, a) ~ 1 
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FIG. 3: The correction frequency l/r a of car motion control 
as function of the car motion quality &(h,v, V,a). 



can be derived: 
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If v, V, a) <C 1 the driver activity in correcting 
the car motion is depressed completely. Otherwise, 
<&(h, v,a) ^> 1, the driver controls the car motion ac- 
tively. This is described by the dependence of the cor- 
rection time interval r a on the car motion state, 

-=g v {l(<f>(h,v,V,a))-. (5) 

The form of the function Q (x) is illustrated in Fig. H. 
Equation (Q) together with expressions (^), (Q), and (g) 
form the proposed car following model with bounded ra- 
tional drivers. 

When gh > g v tt(0) the stationary motion with v = V 
and h = hy is unstable, leading to non-damped but 
bounded oscillations in the headway and velocity of the 
following car. The particular form of the function O (x) 
is of minor importance, it is only necessary that its 
value inside Q„ to be small in comparison with the ra- 
tio gh/gv When analyzing the model numerically the 
following ansatz 

ft(x) = exp[(x - l)/A]/(exp[(> - 1)/A] + 1) 

is used, with the parameter A ~ 0.2. Below, numerical 
results will be presented that demonstrate the character- 
istic properties of the developed model. 

Figure |] displays an example of this dynamic in 
the hv-ph&se plane for the dimensionless headway x = 
(h — hy) I \a c T 2 ) and the relative car velocity u = (v — 
V)/(a c r). As seen in Fig. the behavior of this model 
is qualitatively similar to the empirical data in Fig. [I]. 

Preliminary results have shown that, first, the quasi- 
period of these oscillations in the car velocity is equal to 
t times a numerical factor (about ten) depending weakly 
on the model parameters. For t~1s this period is sim- 
ilar to the observed quasi-period. Second, the amplitude 
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FIG. 4: Simulated car-following behavior. Integration of the 
stochastic differential e qua tion has been performed with the 
algorithms described in p6|. The parameters used are g v — 5, 
g h = 0.2, 11 = 1, and A = 0.2. 



of velocity oscillations does not change substantially as 
the model parameters vary and is about a c r. By visu- 
ally comparing Figs, [ij ^ the estimate a c ~ 0.3 m/s 2 is 
obtained. It should be noted that the amplitude of the 
acceleration oscillations exceeds a c by a numerical fac- 
tor about three. Third, the amplitude of the headway 
super-oscillations, in contrast, depends essentially on the 
parameter g v , enabling one to fix this parameter based 
on experimental data. 



SUMMARY 

A model regarding the bounded rational behavior of 
car drivers has been supposed in this contribution. It 
takes into account that drivers, although having de- 
tailed ideas about their preferred driving strategy, are 
not able to control this driving strategy sufficiently pre- 
cisely. Namely, drivers introduce three main sources of 
error into the optimal driving strategy: instead of keeping 
track of the changes in acceleration they simply choose 
a constant one, that additionally is not the optimal one 
but blurred by noise. This noise models the inability of 
drivers to evaluate exactly the very complex integrations 
leading to an optimal driving strategy. Therefore, the 
need to correct the motion from time to time arises, with 
the correction time intervals distributed randomly but 
inversely proportional to the deviation from the desired 
optimal acceleration. 

It is shown, that these ideas can be captured in a simple 
model for the car-following dynamics, however at the cost 
of introducing a non-Newtonian term, the jerk (change in 
acceleration) . The benefit of doing so is that the resulting 
model has smooth trajectories in headway, velocity and 
acceleration but still being a stochastic one. This discerns 
the approach proposed here from almost all models of 
car-following introduced so far. 

Although the trajectories generated by this model have 



some similarities with real car-following data, the ap- 
proach proposed here still needs thorough testing with 
empirical data. This will be done in the near future and 
will be reported soon. 
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